Learning to predict more accurate text instances for scene text detection
نویسندگان
چکیده
At present, multi-oriented text detection methods based on deep neural network have achieved promising performances various benchmarks. Nevertheless, there are still some difficulties for arbitrary shape detection, especially a simple and proper representation of instances. In this paper, pixel-based detector is proposed to facilitate the prediction instances with shapes in manner. Firstly, alleviate influence target vertex sorting achieve direct regression instances, starting-point-independent coordinates loss proposed. Furthermore, predict more accurate instance accuracy as an assistant task refine predicted under guidance IoU. To evaluate effectiveness our detector, extensive experiments been carried public benchmarks which contain We obtain 84.8% F-measure Total-Text benchmark. The results show that method can reach state-of-the-art performance.
منابع مشابه
Aggregating Local Context for Accurate Scene Text Detection
Scene text reading continues to be of interest for many reasons including applications for the visually impaired and automatic image indexing systems. Here we propose a novel end-to-end scene text detection algorithm. First, for identifying text regions we design a novel Convolutional Neural Network (CNN) architecture that aggregates local surrounding information for cascaded, fast and accurate...
متن کاملFused Text Segmentation Networks for Multi-oriented Scene Text Detection
In this paper, we introduce a novel end-end framework for multi-oriented scene text detection from an instanceaware segmentation perspective. We present Fused Text Segmentation Networks, which combine multi-level features during feature extracting as text instance may rely on finer feature expression compared to general objects. It detects and segments the text instance jointly and simultaneous...
متن کاملText-Attentional Convolutional Neural Networks for Scene Text Detection
Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this wor...
متن کاملEmotion Detection in Persian Text; A Machine Learning Model
This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...
متن کاملScene Text Detection in Video by Learning Locally and Globally
There are a variety of grand challenges for text extraction in scene videos by robots and users, e.g., heterogeneous background, varied text, nonuniform illumination, arbitrary motion and poor contrast. Most previous video text detection methods are investigated with local information, i.e., within individual frames, with limited performance. In this paper, we propose a unified tracking based t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2021
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2021.04.035